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§1.  Introduction 

Let  Z^,  t = 0,  ±1,..-,  be  a stationary  stochastic  process 
and  define  the  clipped  process  by 

fl,  Z > 0 

X , t = 0,  ±1,...  (1.1) 

1^0,  Z^  < 0 

There  are  several  reasons  for  the  transformation  (1.1)  commonly 
known  as  clipping  or  hard  limiting.  If  long  records  of  data  are 
available  then  it  is  convenient  to  clip  the  original  data  and 
store  the  observed  information  in  just  a few  sufficient  tics 

whose  computation  is  extremely  fast  as  only  counting  is  in^  ^ved. 
All  the  more  so,  experience  has  shown,  [6],  [7],  [10],  D.1] , [13], 

[18]  that  very  little  precision  in  estimation  is  lost  due  to  this 
coarse  quantization.  Also,  there  are  situations  [9]  when  the 
data  must  be  actually  observed  in  the  form  (1.1)  as  to  allow 
certain  noise  modulation.  Evidently  the  problem  of  outliers  in 
the  Z^  data  ceases  to  exist. 

Prom  a statistical  viewpoint  (1.1)  is  tractable  because: 
if  the  binary  process  X^  is  finitely  dependent  then  the  like- 
lihood function  of  an  observed  time  series  X, , . . . ,X  is  always 

I n 

available  regardless  of  the  probability  law  which  governs  the 
original  process  . This  means  that  efficient  estimation  of 

parameters  based  on  the  quantized  data  is  possible. 


1' 


Suppose  now  is  an  autoregressive  process  of  order  p 

1 ^ '^2^t  2^*'*^^p^t-p  ^ ^ ” 0,±1,...,  (1.2) 

2 

where  the  u^  are  independent  N(0,a  ) variates,  and  where  the 
roots  of  the  associated  polynomial  1 - - . . . lie  out- 

side the  unit  circle.  In  this  paper  we  shall  obtain  approximate  maxi- 
mum likelihood  estimates  for  <#>,,..., <<>  from  the  clipped  process 

1 P 

X^,  when  is  a stationary  p'th_order  Markov  chain.  The  re- 

lated problems  of  estimating  the  covariance  function  and  spectral 
density  of  Z^  from  the  clipped  data  were  discussed  in  [6],  CllJ , 

[13] , [17],  [is].  The  main  difference  between  these  works  and  the 
present  one  is  that  we  base  our  estimation  on  the  likelihood 
function  of  the  0-1  data  where  the  sufficient  statistics  are 
determined  by  the  axis -crossings  by  Z^.  In  other  words  the 
estimates  are  functions  of  the  particular  arrangements  and  lengths 
of  the  1-runs  and  0-runs  as  formed  by  Z^  while  crossing  level 
0.  principle  will  become  clear  as  we  proceed.  It  is  shown 

that  the  clipped  data  estimates  are  obtained  at  a substantial  sav- 
ing in  computing  as  expressed  in  terms  of  the  number  of  arithmetical 
operations.  At  the  same  time  they  compete  well  with  the  usual  maxi- 
mum likelihood  estimates. 


Exparience  has  shown  that  (1.2)  is  an  adequate  representa- 
tion for  describing  a wide  range  of  stationary  time  series  e.g. 
see  [3],  [1*0.  Maximum  likelihood  estimation  of  the  parameters 
in  (1.2)  based  on  the  original  data  has  been  discussed  in 

D.2] . Moderate  modifications  are  considered  in  Cl],  [3].  It 
turns  out  that  the  maximum  likelihood  estimates  of  the  ob- 


obtained  from  the  original  data  are  essentially  the  same  as  the 
least  squares  estimates. 


In  what  follows  we  shall  make  use  of  a famous  formula.  Let 
(X,Y)  have  a bivariate  normal  distribution  with  parameters 
EX  = EY  = 0,  Var  X = Var  Y = and  correlation  P.  Then  by 

introducing  polar  coordinates  one  obtains 

Pr(X  ^0,  ''>0)=-p  + -;^  sin  ^(p).  (1.3] 

4 ztT 

This  formula  can  be  traced  back  to  the  writings  of  Sheppard  [15] 
Stieltjes  D.6]  . 

§2 . The  Likelihood  Function  of  the  Clipped  Time  Series  and  Esti- 
mation of  Transition  Probabilities 


If  X is  a Bernoulli  random  variable  with  a probability  of 
success  0 then  Pr(X  = x)  = e’^d  - 9)^"*,  x = 0,1-  In  deriving 
this  exponential  distribution,  use  was  made  of  the  fact  that  X 
can  take  the  values  0 or  1.  This  observation  is  instrumental 
in  constructing  the  likelihood  function  of  the  clipped  series 

"f 

Define 


Pr(X^  = Xj  I X^  j^  = x^_^,. . . ,Xj  j^  = Xj  = P 


*i’‘i_l'-*i_k 


and  observe  that  from  (1.1)  we  have 


Pr(Xi  = Xi)  = i, 

= = [Pn<i-Pu>  J J 


Pr(X^  = I X 


0-Xi  1-x  i-l  rx  1-x 

Pillion  J ’KoiPooi  J J 

V 1^110^010  J 1^100^000  J J • 


t 


I 


i 


etc.,  diid  tlierefot’e  in  general 

= 7 

such  that  the  second  product  is  over  all  2^  0-1  i-tuples 
(y^.y.  j^,...,y^)  and 


X.  if  y.  = 1 
1 1 


1-x . if  y . = 0 , 
1 1 


(2.1) 


This  is  the  joint  distribution  of  any  0-1  time  series  X,,...,X 

^ I n 

provided  Pr(Xj^  = Xj^)  = —• 

Evidently,  since  , the  original  process,  is  strictly 
stationary  also  the  clipped  process  is  strictly  stationary;  that 
is,  the  stationary  property  was  preserved  by  X^ . But  Z^  is 
also  Markovian.  However,  mathematically  speaking,  X^  is  not 
a Markov  process  as  well  because  for  one  thing,  the  hard  limiting 
transformation  is  not  1-1.  In  this  case  it  is  apparent  that  (2.1) 
viewed  as  a likelihood  function  is  useless,  for  as  the  series 
size  n Increases  also  the  number  of  unknown  parameters  in- 
creases so  that  (2.1)  cannot  be  used  for  estimation  purposes. 

To  get  around  this  difficulty  one  has  to  condense  somehow  (2.1). 

We  shall  then  overlook  the  mathematical  difficulty  and  assume 
that  the  p^^  order  Markov  property  is  inherent  in  the  clipped 

process.  This  is  a statistical  assumption  which  is  certainly 

2 

reasonable  when  a is  small.  This  assumption  is  analogous  to 
the  normal  assumption  in  the  general  linear  model  or  to  the 
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assumption  of  Poisson  arrivals  in  queuing  theory  for  it  enables 
us  to  pursue  statistical  analysis  based  on  the  likelihood  (?.l). 

Now  let  7.^,  t = 1,2,. ..,n,  be  given  by  (1.2),  and  , 
t = 1,2,. ..,n,  be  given  by  (1.1).  Under  the  assumption  that 
is  a order  Markov  process  the  joint  distribution  (2.1)  re- 

duces to 


Pr(Xj^ 


X ) 

n 


n 1. 1. 

« n n p ^ 

i=p+l  ^i^i-l 


(2.2) 


= n 


n 

I 1 • 

i=p+l  " 
'yi^i-r-yi-i 


.1. 

1-p 


where  as  before  the  product  in  the  last  expression  extends  over 
all  the  2^^^  0-1  arrangements  l’’"'’^!  p^'  (2.2) 

terms  which  do  not  depend  on  n were  ignored.  This  likelihood 
function  is  a special  case  of  the  one  given  by  Bartlett  (1951), 
except  that  formally  we  utilize  it  in  a different  way.  We  shall 
make  use  of  the  following  identity; 

1 = (1-z)  + z = (l-y)(l-z)  + y(l-z)  + (l-y)z  + yz 

= (l-x)(l-y)(l-z)  + x(l-y)(l-z)  + (l-x)y(l-z)  + xy(l-z)  (2.3) 
+ (l-x)(l-y)z  + x(l-y)z  t {l-x)yz  + xyz. 


= Pr(X^  = 1 I X^_j^  *1) , k = 1,2  ....  ,p. 


and  so  on.  Define 
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We  estimate  the  <ii  from  the  X . It  is  possible  to  rewrite 

K K 

(2.2)  as  a product  of  a function  in  Xj^  and  a function  of  nui- 
sance parameters  in  this  case  higher  order  probabilities.  The 
function  in  X is  a power  of  X times  a power  of  1 - X . 
Only  terms  in  (2.2)  of  the  form 


y I....X.  - .X,  ^i.  . ...X. 

1 1-p  + k + l i-p  + k i-p  + k-1  i-p  + 1 i-p 

’y....y.  , .ly.  , .•••y.  ,1 

■^1  i-p  + k + 1 •'i-p  + k-l  i-p  + 1 


(2,4) 


or  of  the  same  form  but  with  the  x's  replaced  by  (1-x. 
and  (1-x^  p)  respectively,  and  the  I's  replaced  by  O's, 
contribute  their  exponents  to  the  power  of  X^^ . The  rest  of  the 
terms  in  (2.2)  contribute  their  exponents  to  (1  - ) • The  sum 

of  all  these  exponents  in  n-p.  Thus  the  exponent  of  X^^  is 


y cyx,...x,  . ,...x,  ,3x,  .1,^. 

^ 1 i-p  + k + l 1-p  + k-l  i-p  + 1 i-p  + k i-p 


+ y [yx....x,  X,  .,...x.  ,](i-x,  ,)(i-x.  ), 

^ 1 i-p+k+1  i-p+k-1  i-p+1  i-p+k  i-p 


where  the  second  sum  extends  over  all  possible  terms  of  the 
form  (2.4).  Evidently,  this  second  sum  is  just  the  telescopic 
identity  (2.3)  and  is  equal  to  1.  The  exponent  of  1 - is 

obtained  iu  the  same  way.  (2.2)  can  now  be  rewritten  as 


2 f X,  ,x.  - y (x.  ^,+x.  )+(n-p) 

i-P+k  i-P  ,_L-,  i-P+k  i-p 


Pr(X^  = x^,. . . ,X^  = x^)«  i=p+l  i=p+l 


(2.5) 


■2  y X.  „^,x.  + y (x.  ^.+x.  ) 

i=p+i  i=p+i 


•powers  of  nuisance  parameters. 


It  follows  that  a maximum  likelihood  estimate  of  is  given 


A {-^.1  ^ ,+x.  ) + (n-p) 

i-p+1  i-p+k  i-p  1 i-p+k  i-p 

\ il£ii 


, k = 1,2,. . . ,p.  (2.6) 


We  = Xj^  and  for  a sufficiently  large  n [2]  Xj^  is 

asymptotically  normal  such  that  Var  \/n(X,  -X,  ) X,  (1-X,  ). 

k K k k 

From  a computational  viewpoint  it  is  preferable  to  express 
in  the  simpler  form 


^ 2Rj^  - 2S  + n 

X,  - , k = 1,2,. ..,p. 


(2.7) 


n n 

where  Ri,  = I X.X,  , , S = y X..  This  unbalsed  form  is 
i=k+l  1=1 


essentially  the  same  as  (2.6)  except  for  end  effects  of  the  series 
which  become  negligible  as  n increases.  We  shall  use  (2.7). 

In  the  next  section  we  show  how  to  obtain  approximate  maximum  like- 
lihood estimates  for  the  from  the  . 

Wow  we  Immediately  see  that  depends  on  the  axis  cross- 

ings by  Z^.  In  particular 
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? _ n-2(#  of  1-runs) 

’ n-1 

or,  neglecting  end  effects  as  is  the  case  for  large  n 


n-1 


(2.8) 


n - (#  of  axis  crossings  by  ) ^2  g) 


n-2(#  of  runs  between  the  first  and  last  1 

? with  two  or  more  symbols  1 1 ) 

\ ■ n-2 


(2,10) 


etc . 


The  exact  distribution  of  the  number  of  axis  crossings  by 
when  is  either  a first  or  second  order  Markov  chain  has 

been  recently  obtained  in  [8]. 

It  should  be  noted  that  the  estimation  problem  of  transition 
probabilities  treated  here  is  somewhat  different  than  the  usual 
problem  of  estimation  of  conditional  probabilities  in  p^^  order 
Markov  chains  where  the  quantities  of  interest  are  the  probabili- 
ties of  a state  given  the  previous  p states.  That  problem  was 
first  solved  in  [2]. 


§3 . Estimation  of  the  ((>*s 

We  start  with  a brief  review  of  the  maximum  likelihood  esti- 
mation of  A, »•••»♦  from  the  original  Z series.  This  will 
^1  p 

give  rise  to  natural  estimates  based  on  the  clipped  data. 

Given  a time  series  z,,...,z  from  the  Gaussian  process 

I n 

Z then  its  joint  distribution  is  N(0,r  ),  where  = iY,.  . 

t ~ 'n  n |i-j  1 

i,j  = I,...,n,  and  the  joint  density  can  be  written  as  the 
product 
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f(z^,.  . . = f(Zp^^,.  ..  ,2^|^,a^,z^,.  . . , z^  ) f ( z^  , . . . , z^  |£,  ) , 

Because  the  second  term  on  the  right  hand  side  is  independent  of 
n the  log-likelihood  for  sufficiently  large  n can  be  conveniently 
defined  as 


2 

e(^)  = log  f ( Zp^j^  , . . . , Z^  1^,  CT  ,Z^ 


y • • • y Z ) 


= - ^ log2na^ Ky  (1,-^') 


2a 


Sz 


-22tS 


■Zz/z^ 

~t  t 


(3.1) 


where 


^ . . . ,<j>p)',  ^ . . . ,z^_p)  ' , and  the  sums 

extend  from  p+1  to  n.  Then 


2 2 z^z^  - 22  zzli} 

^ 2a  ^ ~t  t ^~t 

and  by  equating  to  0 we  obtain  the  maximum  likelihood  estimator 

, -1 


<t>  = (2z  2 ) Zz  Z 

~tM:  ~t  t 


(3.2) 


2 ,„_2 


Now,  multiply  (3.2)  by  1 = 2z^/2z^  and  define 


r 1.1,  -1/  ^^t^t-k  X 

= - t J.  sin  ( --y-  )■ 


Then  apart  from  negllblble  end  effects  (3.2)  takes  the  form 


1 sinn  (l^-%) 

• 

...  sintr(l.  ,-*s) 

-1 

slnff(\  -*s) 

P-1 

1 

sinn(Xj^-*i)  1 

• • 

• • 

...  sinn(X  .-%) 
P-2 

• 

• 

8inTr(X2-*») 

siniK^  ,-S)  8inn(X  --Jj) 
L P-1  P-2 

...  1 

mt 

sinnOi  ->j) 

P 

(3.3) 
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where  X = ( X , . . . , X ) ' , It  follows  that  the  approximate  maxi- 
~ 1 p 

mum  likelihood  estimator  based  on  the  clipped  data  is 


(3.4) 


where  X,  is  given  in  (2.7)  and  X = ( X, , . . . , X )'.  It  is  possible 
k ~ 1 p 

A 

to  calculate  ^(^)  recursively  using  formulae  developed  in  [5], 
This  is  equivalent  to  finding  the  inverse  in  (3.3). 

From  (3.3)  and  (3.4)  it  is  seen  that  the  estimation  problem 
of  (6  was  reduced  to  the  estimation  of  X and  that  the  loss  of 

A 

efficiency  due  to  clipping  depends  on  the  closeness  oJ.  ^ to  \, 
Obviously  when  \ and  X are  close  so  are  ^ and  ^ by  con- 
tinuity. It  is  convenient  to  introduce  the  shift  operator  B 
defined  by  BZ^  = , and  let  4>(B)  = 1 - <t>  . Also 

let  p,  be  the  correlation  function  of  Z^  and  let  ic  (i,j,k) 
k t X 

be  the  fourth  order  cumulant  function  of  (obviously  it  corre- 

sponds to  stationar ity ) . 


K j ^l<) 


-ir-{sin“^(  p.  )sin"^(  p.  ,) 
4Tr^  ^ 


+ sin“^(p^)sin  + sin  ^(pj^)sin  ^(p^_j)}. 


Because  Z is  Gaussian  < vanishes.  Now  K is  a measure 

Xu  X 

of  dependence  in  the  clipped  series  and  since  the  pairwise  depen- 
dence in  this  series  is  weaker  than  the  pairwise  dependence  in 
the  original  Gaussian  process  (see  (4.1)  and  the  remarks  following 
It)  it  is  reasonable  to  assume  that  is  small  and  summable. 
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Thoofom  1:  Let  be  an  AR(p)  procenr. 

2 

are  independent  N(0,ct  ) random  variables,  where  the  roots 
of  <<i(B)  = 0 lie  outside  the  unit  circle.  Let  be  given  by 

(1.1).  If  I 2 (k,-r,j-r)|  < * then  £ converges  to  ^ in 

r = --* 

mean  square  in  the  sense  that 

lim  E||£  - ^11  = 0, 
n -*■“ 

where  |1  • ||  denotes  the  usual  norm. 


Proof : With  obvious  notation  write 

4 = 4(X)  = M ^ p . 


Let 


and 


12 


r 


Then 


= ‘K  ♦ £>■*<£♦')• 

Observe  that  I | < 2j3in  j (Xj^-Xj^)|.  It  follows  from  the  fact 
that  is  absolutely  summable,  EX  = X,  and  [1]  p.  464, 


' ^2  ^(p^)sin  + sin’^(p^_^)sin"^(p^^j^ 


)] 


+ 4 2 »'^(k,-r,j-r) 

r=-« 


(3.5), 


that  E e = 0{l/n).  Thus,  for  sufficiently  large  n we  can  use 
results  from  perturbation  theory.  Accordingly  [4]  we  add  another 
equation 


-1. 


^0  = ^ 

cud  define  the  norm  of  a p x p matrix  A by 


I A II  = 


maximum  H -Av  H 

liY.ll 


We  shall  find  a bound  on 


V € E* 


t-t 


Note  that 


•'t  - til  - ll£  - toll  + llto  - til 


As  n increases  ||M  ^£||  = 6 = O^d/Vn)  becomes  small,  so  that 
for  sufficiently  large  n it  follows  [4]  by  the  perturbation 
theorem  that 


II*  - *oll  ® l'!S'^*ll  ll*oll 
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while 


and  Ik, 


5 (lM“^e||  + 11^  II 

rK/  r-j 


Thus 


I*  - = (IIJM'^ell  + Ik  II ) 11 M liny  + llM‘-'llllell. 


and  observe  that  by  the  Frobenius  norm  estimation 

eIIEII^  S Z Ee^i,  . I = 0(l/n). 
i#j 

Theorem  2 : Under  the  same  conditions  as  in  Theorem  1 the  approxi- 

^ 2 “ 1 

mate  asymptotic  distribution  of  £ is  N((^,  a /n  F^  ). 

Proof;  Observe  that  EX  = X + 0(l/n).  From  [1]  p.  489  and  using 

■ - f\j  rsj 

a Taylor  series  expansion  to  one  term 


lim  n Cov(\  ,X. ) 

n-..  3 


^ Co  P , .+P  .P  ,+2p.p,  P -2p.p  p , -2p,  p p .], 


n^(l  - p^)^^^(l  - 


(3.6) 


from  which  it  follows  that 


0 , n 


Hence 


plim^  = X and  by  the  fact  that  is  continuous  also 

plim^  = From  this  and  Theorem  1 then 

n-^ 

plim  (♦“£)  = 


Therefore  it  is  sufficient  to  consider  the  asymptotic  distribution 
of  ^ only.  Since  it  is  well  known  that  £ has  an  asymptotic 
normal  distribution  we  only  sketch  the  rest  of  the  proof  for  con- 
venience. The  asymptotic  covariance  matrix  can  be  obtained  from 
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(3.1)  by  noting  that  the  ij  element  of  the  information  matrix  ii 


^d<t> 


ind  so  Var(^)  an  — — T . Observe  that 
^ n-p  p 


Vn(t-A)  = ,2.1  ,V.t’ 


and  this  has  the  same  limiting  distribution  as  = l/V^nr^  t=p  + l~t*^t 

Anderson  [1]  pp.  198-200  has  shown  that  if  £ is  any  vector  of 
constants  then  c'y  has  an  asymptotic  normal  distribution.  It 
follows  that  — *■  N(0,a  F ). 

When  Yq»  the  variance  of  , is  known  then  an  approximate 

2 

maximum  likelihood  estimator  of  a is. 


o^  = 2 $.sinn(X  -%)] 

0 k = l 


(3.7) 


However  when  is  unknown  it  is  impossible  to  estimate  it 

from  the  clipped  data  since  the  variability  in  the  Z^  process 

is  not  preserved  in  the  axis  crossings.  This  means  that  there  is 

2 

no  connection  between  X,  and  o . In  order  to  be  able  to  estimate 

k 

2 2 

a the  quantity  JCz^  is  needed  in  addition  to  the  clipped  data. 

We  consider  now  two  important  special  cases. 


Example  3.1.  Estimation  In  AR(1). 


Assume  Z^  = ^l^t  1 * '*t  * likelihood  (2.2) 

reduces  to 


L(X^)  = 1/2(1  - 


(3.8) 
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where  d = 2s  - 2r^  - (>^2^+  ) is  the  number  of  axis  crossings 

by  . Owing  to  (2.9),  (3.4)  becomes 


= <<)  (X  ) 
1 


= s in  n {. 


(n-1)  - (it  of  axis  crossings)  JL j (39) 


This  estimator  has  been  studied  in  [7]  where  it  was  shown  that  it 
competes  well  with  the  estimate  obtained  from  the  original 

data.  behaves  remarkably  well  even  for  | (<)^  ( close  to  1. 

Now  from  (3.8)  and  the  uniqueness  of  the  binomial  expansion  we 
immediately  obtain  (see  [8]  for  another  proof) 

D ~b(n-l,  1-X^) . (3.10) 

That  is,  the  distribution  of  the  number  of  axis  crossings  D under 
the  assumption  that  is  a first  order  Markov  chain  is  binomial. 

However  in  [8]  it  was  experimentally  shown  that  (3.10)  is  reasonable 
only  when  is  in  the  approximate  range  ( I 2 0.6.  This  dis- 

cussion leads  to  the  interesting  observation  that  the  Markov  assump- 
tion is  suitable  for  estimation  as  is  seen  from  Theorem  1.  However 
the  distribution  (3.10)  obtained  under  this  assumption  should  be 
regarded  not  without  reservations.  This  seemingly  a paradox  can  be 
resolved  as  follows.  The  estimation  problem  is  rather  simple  since 
estimated  is  only  one  parameter  while  the  distribution  problem  is 
much  more  complex  and  may  involve  many  parameters  which  our  Markov 
assumption  excludes  from  consideration.  But  when  is  rather 

small  (3.10)  is  adequate  from  which  it  follows  directly  that 
VrTCXj^-Xj^) — ► M(0,Xj^(l-X^) ) , n -*  •»,  which  in  turn  Implies  that 

A I 2 

)— * M(0,ii  X^(l-Xj^)),  n Thus,  to  test  the  hypothesis 


16 


that  is  white  noise,  i.e.  ^o'^l  ” reject  for  large 

values  of  Vn 

The  main  advantage  of  is  that  its  computation  is  fast 

and  can  be  carried  on  a small  calculator.  Let  us  compare  the  ex- 
pected number  of  arithmetical  operations  needed  in  order  to  cal- 
culate  vs.  the  number  needed  for  . Neglecting  end  effects 

we  have : 


Expected  # of  arithmetical  operations  needed  for 
Expected  ^ of  arithmetical  operations  needed  for 


*1«^  j(l-Xj^)n  + 6 
w 4n, 


where  taking  the  sine  and  counting  a 1— run  are  considered  as  single 
operations. 


Example  3.2;  Estimation  in  AR(2) 

Here  (1.2)  becomes 

^t  ~ *l^t-l  ^2^t-2  “t 
The  estimates  (3.4)  are 


sin[n(  X^-!j)3[l  - sinn(X2->i)] 


Cos  [it(Xj^-Jj)3 


* 8inn(X2-Jj)  - sin^[ff{X^-J,)D 


Co8^[n(Xj^-ls)] 


(3.11) 


Note 


that  1^2 1 ^ ^2  ^ ^2  ” ^ ^ needed  for 


to  be  stationary. 
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It  follows  from  (2.8),  (2.10)  that  and  are  functions 

of  the  number  of  axis  crossings  and  the  number  of  runs  with  two 

or  more  symbols  in  ti.  0-1  series.  We  investigate  here  via  an 

extensive  simulation  a ’ase  suggested  by  Wold  (1965)  and  studied 

2 

in  tl]  section  5.9.  Accordingly  » 

2 6 2 

o = (1  - )/(l  + <<>  ) so  that  VarZ^  = 1.  The  model  is  then 

^t’ 

The  values  of  are  0.25,  0.7,  0.9  and  of  n 250,  500,  1000, 

2000.  For  each  pair  (<<),n)  ten  time  series  were  generated  from 
which  the  sample  averages  and  variances  of  , <t>^  and  of  , 

$*2  were  computed.  The  results  are  given  in  Table  2.  Table  1 
gives  typical  records  corresponding  to  the  three  choices  of  <f), 

A A 

It  is  seen  that  are  on  the  average  just  as  good  point 

Ay  Ay 

estimates  as  are.  However  the  variances  of  <fi^ 

A A 

are  usually  smaller  than  those  of  and  Roughly  speaking, 

in  order  to  achieve  the  same  order  of  precision  for  the  estimates 
based  on  the  clipped  data,  the  0-1  series  should  be  approximately 
twice  as  large  as  the  original  series.  Although  the  estimates 
based  on  the  original  series  are  endowed  with  a relatively  higher 
efficiency,  for  large  data  records  the  difference  between  these 
two  types  of  estimators  is  negligible.  This  is  in  accordance 
with  Theorem  2.  The  study  of  this  relative  efficiency  is  the  sub- 
ject of  the  next  section. 

What  happens  if  in  the  clipped  series  the  O's  are  replaced 
by  I's  and  the  l*s  are  replaced  by  O's?  In  this  case  the 
information  pertaining  to  axis  crossings  Is  left  intact  and  the 


same 


is  true  for  the  number  of  runs  with  at  least  two  symbols 
between  the  first  and  last  1 except  for  negligible  end  effects. 
This  means  that  we  obtain  almost  exactly  the  same  estimates  for 
*^1*  example  with  n = 1000  and  (p  ~ 0.25  we  obtained 

ave  = 0.2539,  ave  = -0.0646,  Var  = 0.0017,  Var 
0.0026  which  are  almost  the  same  as  the  corresponding  results 
in  Table  2. 


Table  1.  Typical  realizations  of  size  50 
obtained  by  clipping 


* Zt-2  ^ 


~ N(0,(1-^  )/(l+i^  )) 


# Axis- 

<b  Clipped  Series  crossings 

0.25 

OllllOlOlOOOOllOllllOOlllOllOOOOlOOllOOOlOlllOOOll 

23 

0.7 

01111000110001100111001110010000110010001111100011 

19 

0.9 

11000111000111000111000111000100000000111000111110 

15 

Again  (3.11)  can  be  computed  much  faster  than  and  4> 

In  fact  the  difference  in  the  expected  number  of  operationsis 

1 A A 

approximately  6n  - x (2  - X.,  - X ) in  favor  of  and  ^ . 


Estimat 
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§4.  Loss  of  efficiency 

A problem  associated  with  estimation  after  hard  limiting 
is  the  problem  of  measuring  the  loss  in  the  precision  of  our 
estimates  due  to  this  transformation.  That  clipping  results  in 
a loss  of  "information"  can  be  seen  from  the  inequality 


(4.1) 


2 - 1 

where  p (k)  = — sin  ( p,  ) is  the  autocorrelation  of  X . Now 

X Ti  k t 

since  in  addition  X and  X are  independent  if  and  only  if 

X L ~ K 

they  are  uncorrelated,  (4.1)  means  that  the  pairwise  dependence 
in  the  clipped  series  is  weaker  than  that  in  . Intuitively, 
this  may  affect  the  efficiency  of  estimates  computed  from  the 
clipped  data.  More  precisely,  if  (#>^(X)  is  the  component 

of  Then  upon  using  a Taylor  series 

approximation  we  obtain  when  n is  sufficiently  large 


Var  <f>  = Var  4,  (X)  w a 

U ^ O ~ ~ 


Var  (p  = Var  <t>  (X)  w a*  Z-j^a 
^0  ~ ~o  ~X~o 


(4.2) 


where  a 
~u 


( a . 


) and 


is  the  covariance  matrix  of 


/ a IIU  ^ 

1 u p o 

A A 

\.  It  is  convenient  to  associate  the  efficiency  of  ^ with  the 

sum  of  the  eigenvalues  of  Zf.  Accordingly  we  define 

~X 


def 


eff  (^) 


(4.3) 


I 
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The  choice  for  (4.3)  stems  from  the  simplicity  of  its  calcula- 
tion and  the  fact  that  if  trZ*  is  relatively  large  so  will 

A 

be  Var  too.  Using  the  dominating  terms  in  (3.5)  and  (3.6) 
we  obtain 


Var  X 

ll  « 

Var  X 

k 


(1-p?)  ^ _(sin  ^(p  ))^ 

K r= - • r 


(4,4) 


Therefore  when  is  small , e.g.  |p  | < 0,2,  k#0,  (4.3) 

is  near  unity  and  the  clipping  results  in  almost  no  loss  for 
all  practical  purposes.  In  general  we  obtain  an  approximate 
upper  bound  using  (4.1) 


ef  f (^  ) 

A./ 


2 TT  ^ 

< max  (1-P  ) -T— 
- k 4 

k— 1,, . , ,p 


(4.5) 


Because  Var  = 0(l/n)  (4.5)  means  that  for  2 to  be  as 

efficient  as  is  the  binary  series  should  be  roughly  twice 

as  long  as  the  original  series.  The  experimental  results  in 

Example  (3.2)  support  this  finding.  See  [13],  [18]  for  similar 

results.  It  should  be  noted  however  that  for  long  series  the 

difference  between  ^ and  ^ is  negligible.  Also,  observe  that 

when  p > 0 an  approximate  upper  bound  for  the  ratio  Var  ^ / 

K y 

Var  is  again  (4.5).  Finally  it  is  easy  to  see  that  the  expected 
number  of  operations  that  can  be  saved  owing  to  clipping  is  at 
least  (pt2)n  for  an  AR(p). 


} 

i 

I 
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